Linking Natural Language Processing and Biology: Towards Deeper Biological Literature Analysis
ثبت نشده
چکیده
Most current definitional question answering systems apply one-size-fits-all lexicosyntactic patterns to identify definitions. By analyzing a large set of online definitions, this study shows that the semantic types of definienda constrain both lexical semantics and lexicosyntactic patterns of the definientia. For example, “heart” has the semantic type [Body Part, Organ, or Organ Component] and its definition (e.g., “heart locates between the lungs”) incorporates semantic-typedependent lexicosyntactic patterns (e.g., “TERM locates ...”) and terms (e.g., “lung” has the same semantic type [Body Part, Organ, or Organ Component]). In contrast, “AIDS” has a different semantic type [Disease or Syndrome]; its definition (e.g., “An infectious disease caused by human immunodeficiency virus”) consists of different lexicosyntactic patterns (e.g., “...causes by...”) and terms (e.g., “infectious disease” has the semantic type [Disease or Syndrome]). The semantic types are defined in the widely used biomedical knowledge resource, the Unified Medical Language System (UMLS).
منابع مشابه
Towards Incorporating Scientific Literature into Biological Algorithms
The Tutorial This tutorial is a practical introduction on applying natural language processing (NLP) to biological research. It focuses on basic methodology used in current research efforts both in the published literature and in our lab. This tutorial exposes attendees to a broad range of work in the field, and at the end they will understand the technologies applied to solve NLP problems in b...
متن کاملLinking Brain Biology to Intellectual Endowment: A Review on the Associations of Human Intelligence With Neuroimaging Data
Human intelligence has always been a fascinating subject for scientists. Since the inception of Spearman’s general intelligence in the early 1900s, there has been significant progress towards characterizing different aspects of intelligence and its relationship with structural and functional features of the brain. In recent years, the invention of sophisticated brain imaging devices using Diffu...
متن کاملTowards deep reasoning with respect to natural language text in scientific domains
In this paper we take some initial steps towards deep reasoning with respect to natural language text in scientific domains. In particular we consider answering Why and How questions with respect to a high school Biology text. In comparison to other kinds of questions, Why and How questions (the later referred to as procedural questions by some) have been less explored in the Question Answering...
متن کاملBioLexicon: A Lexical Resource for the Biology Domain
Natural language processing technologies have advanced remarkably in the past two decades. However, biological terminology is a frequent cause of analysis errors when processing literature written in the biology domain. The BOOTStrep BioLexicon is a linguistic resource tailored for the domain to cope with these problems. It contains the following types of entries: (1) a set of terminological ve...
متن کاملGENIES: a natural-language processing system for the extraction of molecular pathways from journal articles
Systems that extract structured information from natural language passages have been highly successful in specialized domains. The time is opportune for developing analogous applications for molecular biology and genomics. We present a system, GENIES, that extracts and structures information about cellular pathways from the biological literature in accordance with a knowledge model that we deve...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006